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1 Leading up to the results 

Singular points of random matrix- valued analytic functions are a common gen- 
eralization of eigenvalues of random matrices and zeros of random polynomials. 
The setting is that we have an analytic function of z taking values in the space of 
n x n matrices. Singular points are those (random) z where the matrix becomes 
singular, that is, the zeros of the determinant. This notion was introduced in the 
Ph.D thesis iflOl of the author, where some basic facts were found. Of course, 
singular points are just the zeros of the (random analytic function) determinant, 
so in what sense is this concept novel? 

In case of random matrices as well as random analytic functions, the following 
features may be observed. 

1. For very special models, usually with independent Gaussian coefficients or 
entries, one may solve exactly for the distribution of zeros or eigenvalues. 

2. For more general models with independent coefficients or entries, under 
rather weak assumptions on moments, one can usually analyze the empirical 
measure of eigenvalues or zeros as the size of the matrix increases or the 
degree of the polynomial goes to infinity. 

3. Substituting independence with assumptions of particular kinds of symme- 
try and dependence of entries or coefficients, eigenvalues and zeros have 
been studied with varying degrees of success. 
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The point here is that the determinant of a random matrix- valued analytic function 
has coefficients that are dependent in a very complicated way and one might not 
expect it to be tractable. Nevertheless, 

• In this paper we demonstrate that certain special models of random matrix- 
valued analytic functions based on independent complex Gaussians have 
singular sets that turn out to be determinantal point processes in the two 
dimensional sphere or the hyperbolic plane! This is the exactly solvable 
situation. 

• In a subsequent paper dU, we shall study asymptotics of the counting mea- 
sure on the singular set of random matrix-valued analytic functions, under 
weak assumptions of independence (and some moment conditions), as the 
matrix size goes to infinity. This will be a generalization of the circular law 
for non-Hermitian random matrices with independent entries. 

• Numerous questions suggest themselves, taking unitary or Hermitian ma- 
trix coefficients in a random polynomial, for example. Answers are yet to 
suggest themselves. 

Despite its natural appeal, the concept of a random matrix- valued analytic function 
does not seem to have been considered in the literature, perhaps because the focus 
in random matrix theory has been mostly on eigenvalues in one dimension (real 
line or the circle). We do not know a way to force singular points to lie on the line 
(except the case of eigenvalues of Hermitian matrices). 

A necessary notion needed to even state some of our results is that of a deter- 
minantal point process, first defined by Macchi ffTTTl . For the reader not familiar 
with them, a brief introduction to determinantal processes is given in the Ap- 
pendix. This is sufficient for the purposes of this paper, but to know more, the 
reader may consult the surveys [16J or [6J. The reader interested merely in our 
results and proofs may now jump directly to the next section. The rest of this 
section is devoted to motivating the results and establishing the context and is not 
logically necessary to read the rest of the paper. First some notations. 

Notations: D is the unit disk in the complex plane. The two-dimensional sphere 
S 2 will always be identified with C U {°°} via stereographic projection, m denotes 
Lebesgue measure, g and gj, i > 0, are always independent standard complex 
Gaussian random variables, with density n~ l exp{ — |z| 2 } in the plane. G and G,, 
i > 0, are n x n matrices whose entries are i.i.d. standard complex Gaussians. The 
group of permutations of a set T will be denoted by S (T). When T = {1,2, . . . ,k} 
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we just write Sk- We denote the set { 1 , 2, ... , k} by [k] . And 11 (N) is the group of 
N x N unitary matrices. 

The results of this paper were motivated by two well known results, one from 
the realm of random matrices and another concerning zeros of random analytic 
functions. The first is due to Ginibre [5], who found the exact distribution of 
eigenvalues of a random matrix with independent standard complex Gaussian en- 
tries and the second is due to Peres and Virag [fT3l who discovered the exact dis- 
tribution of zeros of the random power series with independent standard complex 
Gaussian coefficients. 

Result 1 (Ginibre(1965)). The set of eigenvalues of the nxn matrix G with i.i.d. 
standard complex Gaussian entries is a determinantal point process with kernel 

k=o K - 

with respect to the reference measure d/j(z) = \ e ~^ ' ■ The corresponding Hilbert 
space H = span{ 1 , z, . . . , z n ~ 1 } C L 2 (C, dm{z) ) . 

Result 2 (Peres and Virag(2003)). Let f(z) = g + g\z + giz 1 + ... be the ran- 
dom analytic function (the radius of convergence is 1) whose coefficients gj are 
i.i.d. standard complex Gaussians. Then the zeros of f form a determinantal point 
process on the unit disk D with the kernel (the Bergman kernel of the unit disk) 



K(z,w) 
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with respect to the background measure d/u(z) = ^dm(z) on D. The corresponding 
Hilbert space H = span{l,z,z 2 , . . .} C L 2 (D, dm ^ ) is the space of all analytic 
functions in L 2 (D). 

Knowing that a point process is determinantal greatly facilitates studying its 
properties. This motivates us to ask whether these two are isolated results or 
whether they are part of a bigger picture. As a start, let us draw a list of random 
analytic functions whose zero sets are known to be determinantal. 

• z — g = : One zero, g, with standard complex Gaussian distribution on C. 
H = span{l} in L 2 (C, s -^-dm(z)) . Determinantal, of course! 
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• Zgi — g2 = : One zero, that has density ^j^rpm dm(z) . This is just the 

push-forward of the uniform measure on S 2 = C U {°°} under stereographic 
projection. Again, a one-point process is determinantal. In this case, H = 
span{ 1 } in L 2 (C U {00} , —J^dm(z) ) . 

• go + zgi + z 2 g2 + • • • = 0. This is the i.i.d. power series of Theorem[2l 

Our key observation is that Ginibre's result (Result Q]) describes the law of ze- 
ros of the random analytic function det(z/ — G), which may in turn be thought of 
as the matrix version of the analytic function z — g, the first of the three exam- 
ples above. This suggests that we consider the matrix versions of the second and 
third examples. This leads us to two families of random matrix-valued analytic 
functions. 

1. zG\ — G2, where G\, G2 are n x n independent matrices with i.i.d. standard 
complex Gaussian entries. 

2. Go + zG\ + z 2 G2 + where Gk are independent n x n matrices with each 
Gk having i.i.d. standard complex Gaussian entries. 

The analogy with Ginibre's result strongly suggests that the singular points of 
these matrix- valued analytic functions might be determinantal point processes. 
But how to guess which determinantal processes? 

Invariance to the rescue: The key feature that allows us to guess which deter- 
minantal processes, is invariance under a large group of transformations. A point 
process X on the space Q. is said to be invariant (in distribution) under a transfor- 
mation T : Q. -> Q. if T(x )= X . 

• First consider the matrix-valued analytic function zG\ — G2. We claim that 
the singular points (which are just the eigenvalues of G\ l G2) are invariant 
under all linear fractional transformations 

with a, P complex numbers satisfying |oc| 2 -f||3| 2 = l. These are precisely 
rotations of the two-dimensional sphere, when the sphere is identified with 
C U {00} via stereographic projection. 
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To see this, let a, (3 be complex numbers such that |a| 2 + |(3| 2 = l. Then 
define 

C = aGi- pG 2 and D = pGi + aG 2 . 

Then (C,D) has the same distribution as (Gi,G2). This implies that the 
solutions to det(zC — D) = have the same distribution as the solutions to 
det(zGi — G2). On the other hand, 

det(zC-D) = det((za-(3)Gi-(zP-a)G 2 ) 

Thus the zeros of det(zC — D) are precisely j where {X,} are the 

zeros of det(zGi — G2). Thus we have 

r J l<i<n 

for any a, P with |a| 2 + |(3| 2 = 1 which is what we claimed. 

00 

Next consider the matrix- valued analytic function M(z) '■= L G^z where 

k=0 

Gk are independent random matrices with i.i.d. standard complex Gaussian 
entries. We claim that the set of singular points is invariant in distribution 
under the isometries of the hyperbolic plane, namely the linear fractional 
transformations 

v(*) = 2r^ M 2 -IPl 2 = i, 

that map the unit disk injectively onto itself. These are precisely the con- 
formal automorphisms of the unit disk. To see this, observe that for each 
1 < h j < n, 

M u (z) = Go(i, j) + Gi (i, j)z + G 2 (/, j)z 2 + ... 

is a copy i.i.d. power series of Theorem [2l and the M ; ; (-)s are themselves 
independent random functions. It is known (see [15]) that 



M^(90) = 9TrHy(-)- (3) 
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This is because M\ t \ is the Gaussian element of the Hilbert space of an- 
alytic functions with the boundary inner product Jq 71 f(e lQ )g(e l& )^ (We 
define this inner product for analytic functions that extend continuously 
to the boundary, and take the completion). On this Hilbert space, / — > 

( P'(0 _2 /( ( P(')) * s a un i tar y transformation. When a unitary transformation 
is applied to the Gaussian element of the Hilbert space, we get again the 
Gaussian element of the Hilbert space, yielding © (of course, the Gaussian 
element is itself not an element of the Hilbert space, almost surely). 

An alternate way is to just check that the centered Gaussian processes on 

the two sides of © have the same co variance kernel * == . The reader 

l-<P(z)<P(w) 

may refer to the paper of Sodin and Tsirelson [15] for a more detailed 
proof. Anyhow, the independence of distinct My and the fact that (p' is 
non-random shows that 

det(M(cp(-))) = cp / (r ? det(M(-)). 

Now, cp' is a nowhere vanishing analytic function on the unit disk. There- 
fore the above equation shows that the singular set of M(-) is invariant in 
distribution under the action of hyperbolic isometries. 

These are two special cases of a large class of invariant zero sets introduced in 
IfTOl . The general situation is that one applies a homogeneous polynomial of sev- 
eral complex variables ("det" in our case) to a bunch of i.i.d. copies of a Gaus- 
sian analytic function (zgi — g2 or go + zgi + z 2 g2 + ■ ■ ■ in the two cases). If the 
individual Gaussian analytic functions have invariant zero sets, then so will the 
homogeneous polynomial of copies of them. The idea is that constructions (ap- 
plying a homogeneous polynomial to i.i.d. copies of an analytic function) which 
are simple in terms of functions, are not simple at all at the level of zeros, and may 
give something drastically new. 

Let us return to our original question which led to a digression into the issue of 
invariance. This was the question of guessing what determinantal processes might 
the singular sets of the two families of random matrix- analytic functions (zG\ — G2 
and Go + zG\ + z 2 G2 + ■ ■ ■) be. We now have obtained a strong restriction: the de- 
terminantal process better be isometry-invariant in § 2 or D, respectively. Further, 
from the fact that we are looking at zeros of analytic functions, we expect that 
these determinantal processes are defined by Hilbert spaces of analytic functions. 
Such determinantal processes were classified in [ 10] (see Theorem 3.0.5 therein). 
From this classification, we get the following invariant determinantal processes as 
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the only possible candidates. These processes were, in fact, studied first by Cail- 
lol [QQl under the name "one component plasma on the sphere" and by Jancovici 
and Tellez on the hyperbolic plane. 

1 . On the sphere (C U {°°}) we have for each n 6 {1, 2, . . .} an invariant deter- 
minantal point process with kernel 

K(z,w) = (\+zw) n - 1 

with respect to the background measure d/u„(z) = n ttW+i dm{z). Invari- 
ance of the point process under an analytic transformation T is equivalent 
to saying that the joint intensities (correlation functions) with respect to 
Lebesgue measure satisfy 

p k (Tz h ...,Tz k )\T'(zi)\ 2 ...\T'(zk)\ 2 = Pk(z h ...,z k ). 
For the case at hand, this is easily checked from the fact 



T ; {z)T'(w) _ 1 



(\ + T(z)T(w)) 2 (l+zw) 2 ' 

The parameter n is the total number of points in the point process, or equiv- 
alently, it denotes the first intensity of the point process with respect to the 
spherical area measure , 1+ | z pp - 

2. On the unit disk, we have for each n > 0, an invariant determinantal point 
process with kernel 

K(z,w) 1 



(\-zw) n+1 

with respect to the background measure d/j n (z) = | ( 1 — | z | 2 ) n ~ 1 dm (z) . Again 
it is easy to check that these are invariant, now using 



<pWW i 



(l-(p(z)cp(w)) 2 (l-zw 

The parameter n is denotes the first intensity of the point process with re- 
spect to the hyperbolic measure n r l \ z fl\i ■ 

Thus, on each of the sphere and the disk, we have a family of invariant singular 
sets and a family of invariant determinantal processes. Then by comparing the first 
intensities of these determinantal processes and the set of singular points of our 
matrix- analytic functions, we match the singular sets to determinantal processes. 
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2 Statements of results 



We now state our results. 

Theorem 3. Let G\,G2 be i.i.d. n x n matrices with i.i.d. standard complex 
Gaussian entries. The zeros ofdeX(zG\ — G2) form a determinantal point process 
on S 2 with kernel 

K(z,w) = (l+zw) n - 1 

with respect to the background measure d/j n (z) = f (i+M2v»+i • Equivalently, we 
may say that the defining Hilbert space is the subspace of analytic functions in 
L 2 (CU{oo} 5A , n ). 

Theorem 4. Let Gk be i.i.d. nxn matrices with i.i.d. standard complex Gaussian 
entries. Then for each n > 1, the zeros of det (Go + zG\ +z 2 G2 + ■ ■ ■) form a 
determinantal point process on D with kernel 

K(z,w) 



(l-zw) n+1 

with respect to the background measure dp. n (z) = ^(1 — \z\ ) n dm(z). Equiv- 
alently, we may say that the defining Hilbert space is the subspace of analytic 
functions in L 2 (©,//„). 

Theorem [3] is proved section [3] via the Schur decomposition of the matrix 
G7 G2, along the lines of Ginibre's proof of Theorem [Q Theorem @] will be 
proved in section \5\ as a corollary of the following more general theorem which 
appears to be of potential interest beyond the specific application to Theorem HI 
Theorem [5] is proved in section HI 

Theorem 5. Let Am be nxn matrices such that VNAn — > Xofor some random 
matrix Xq. Independently of A, pick P, Q independent matrices chosen from Haar 
measure on U (N) and define the N x N matrix V by 



V = Q* 



A N 
In-u 



P*. (4) 



Set fyv(z) := d^/^A * ) ■ Let Xq and Gi, i > 1, be independent random matrices, 
where Gi have independent standard complex Gaussian entries. Then 



N n /%(z)^dct(x +Y,G k A 
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in the sense that any finite set of coefficients in the power series expansion of 
converge jointly in distribution to the corresponding vector of coeffi- 
cients in the power series expansion of the right hand side. 

The relevance of this theorem to Theorem |4] is through a result of Zyczkowski 
and Sommers IfTTl who found random matrix models whose eigenvalue distribu- 
tions are determinantal processes with kernels that are truncated versions of the 
kernels in Theorem HI Theorem [5] gives the distribution of the limiting random 
analytic function, as the matrix size increases, while the result of Zyczkowski and 
Sommers gives the limiting distribution of zeros. Putting the two together we get 
the distribution of zeros of the limiting random analytic function. 

Remark 6. The statements of Theorem [3] and Theorem 0] may already be found 
in the thesis [10]. Theorem [3] appeared there with a proof but is being published 
here for the first time. Theorem |4] was conjectured in ifTOl and a partial proof was 
given, showing that the first and second joint intensities (correlation functions) of 
the singular set of Go + zG\ + z 2 G% + ■ ■■ are as claimed. While in IfTOl we tried 
to prove Theorem H] by starting with the random matrix- valued analytic function 
and then finding the distribution of its zeros, in contrast, in this paper we take the 
opposite direction. 

Remark 7. It is natural to ask whether there are other (perhaps even many) sin- 
gular sets that are also determinantal. Without claiming that there are not, we 
would like to emphasize that the determinantal processes in Theorem \T\ together 
with those in Theorems 13141 are the most natural determinantal point processes in 
the three canonical surfaces of constant curvature, namely, the plane, the sphere 
and the hyperbolic plane, respectively. As remarked earlier, these processes were 
studied first by Caillol [Q]| on the sphere and by Jancovici and Tellez Q on the 
hyperbolic plane (related "two-component plasmas" were studied by Forrester, 
Jancovici and Madore flU). 

Independently, in ifTOll . motivated by an analogous theorem of Sodin lfl4l for 
zeros of Gaussian analytic functions, it was proved that on each of these three 
domains, there is exactly a one parameter family of invariant determinantal point 
processes that arise from Hilbert spaces of analytic functions. There are some 
additional conditions, see Theorem 3.0.5 in IfTOll for precise statements. These 
determinantal processes are exactly those that correspond to Hilbert spaces of 
analytic functions on these domains with respect to the following measures. 

• dna{z) = ^e~ a \ z \ 2 dm(z), a > in the plane. In this case, changing a merely 
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has the effect of scaling the plane, and therefore these determinantal pro- 
cesses should be thought of as identical. 

• dna{z) = rc (1+ |" 2 )a+i <Mz) for a G { 1 , 2, 3, . . .} in the sphere (C U {°o}). 

• d/j a (z) = |(1 — \z\ 2 ) a ~ l dm(z) for a > in the unit disk. 

The determinantal processes appearing in Theorems 13 141 are precisely these canon- 
ical ones, while the determinantal processes in Ginibre's theorem converge (as the 
matrix size increases) to the canonical determinantal point process on the plane. 
The upshot of all this is that determinantal singular sets are somewhat special and 
may not be all that abundant. 

Note that while canonical determinantal processes in the unit disk exist for 
every a > 0, Theorem[4]gives a singular-set interpretation only for positive integer 
values of a. 



3 Spherical ensembles 

Let X denote the set of singular points of zG\ — G%. Since the number of points 
is exactly n, Theorem[3]is equivalent (see the facts stated after definition [FT] in the 
appendix) to saying that the joint density of the singular points is proportional to 

iA(zi,...,z„)i 2 n (1+ |^ 2)B+ t- 

(If {P\, . . .,P n } are the points on the two-dimensional sphere obtained by stere- 
ographic projection of zi,---,Z n , then the density of these points with respect to 
Lebesgue measure on (§ 2 )" is simply 

where || • || R 3 is the Euclidean norm in M?). 

The following lemma will greatly simplify the job of integrating out auxiliary 
variables later. 

Lemma 8. Let X be a point process on C with n points almost surely. Assume 
that the n-point correlation function (equivalently the density) of X has the form 

p(zh...,Z n ) = | A(zi,...,z„) | 2 V(|zi| 2 ,...,|z„| 2 ). 
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Here A(zi , . . . ,z n ) denotes the Vandermonde factor Y[ (zj — Zi). 

Suppose also that X has a distribution invariant under automorphisms of the 
sphere 8 2 i.e., under the transformations cp a .p(z) = f or an y a ' P satisfying 

|a| 2 + |(3| 2 = 1. Then 

V(\Z, I 2 ,---, kn| 2 ) = QlWt ft (l + |^|2)n + l - < 5 > 

Proof of Lemma HI The claim is that the probability density of the n points of X 
(in exchangeable random order) with respect to Lebesgue measure is 
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q(zi , ■ ■ ■ , z n ) := Const. | A(zi ,...,Zn)\ 2 \[ (1 + | z J2y»+r 



it=l 



First let us check that the density g is invariant under the isometries of S 2 . For this 
let <P<x,p(z) = 5^=' witn a, P satisfying |oc| 2 + |p| 2 = 1. Then, 



<P'W = , ^ 1 _, 2 - (6) 
(-pz + a) 2 

1 + |9(^)| 2 = , ^Lti - (7) 
| -pz + a| 2 

q>W - q>M = . = *~\ w (8) 

(-pz + a)(-p\v + a) 

From ©,© and ®, it follows that 

n 

q(<?(zi), . . . , q>(z„)) n |q/(zjt) | 2 = 9 (zi, • • • ,z n ) , (9) 
fe=i 

which shows the invariance of q. 

Invariance of X means that Voc, (3 with |a| 2 + 1|3| 2 = 1, and for every z\,...,z n , 
we have (with cp = cp a r) 

n 

p (cp(zi), . . . , q>(z„)) \\ \(p'(z k )\ 2 = p(zi,...,z n )- (10) 

k=l 

Set W(z h . . . ,z„) = Then, we get 
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W(z\,...,z n ) is a function of |zi | 
and the definition of q. 



\z n \ only, by the assumption on p 



• W(q(zi ),..., <p(z„)) = W(zi,...,z„) for every zi z„ from (flOl) and (l9l). 

We claim that these two statements imply that W is a constant. To see this fix 



Zk = rke'® k , 1 < k < n, such that r\ < for k > 2. Let a 



Then |a| + |(3| = 1 and so cp a p is an isometry of § . From the above stated 
properties of V7, we deduce, 

W(zi,...,z M ) = W(q>(zi),...,<p(*0) 
= W(0, 



W 



1+Z2Z1 l+Z M Zl 

r 2 e i ^-z\ 
1 + r 2 e^zi 



1 +r n e lQ "z\ 



Take zi = 1 and 1 < r& < 1 + e. Then as 0^, 2 < k < n vary independently over 



[0,2rc], the quantities | ^ ^.l 1 I varv over tne intervals 
the left side, W(z\, • • • ,z«) does not change because W is a function of r^s only. 
By our choice of r^s, this means that 

W(0,t2,---,tn) = Constant \/t k e £, 

£ is arbitrary, hence V7(0,?2, • • -,*n) is constant. This implies that W(0,Z2, • • • ,z«) 
is constant and therefore W(zi, . . . ,z„) is constant. 

This shows that p(zi,. . . ,z«) = Const.g(zi, . . . ,z«). □ 

Proof of Theorem |3j Recall which asserts that X is invariant in distribution 
under the action of automorphisms of S 2 . By Lemma [8l it suffices to show that 
the density of points in X is of the form given in ©. We use the following well 
known matrix decomposition. 

Schur decomposition: Any diagonalizable matrix M e GL(n : C) can be written 

as 

M = U(Z+T)U\ (11) 

where U is unitary, T is strictly upper triangular and Z is diagonal. Moreover the 
decomposition is almost unique, in the following sense: 



r k + \ 
r k + V r k -l 



. However 
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M — V(W + S)V* in addition to £[]]), with V, 5, W being respectively unitary, 
strictly upper triangular, and diagonal, if and only if the entries of W are a permu- 
tation of the elements of Z, and if this permutation is identity, then V = U® and 
&S&* = T for some that is both diagonal and unitary, that is, for of the form 
Diagonal(V ei , . . . ,e' e ") . 

Corresponding to this matrix decomposition (fTTI) . Ginibre Q proved the fol- 
lowing measure decomposition. 

Ginibre's measure decomposition: If M is decomposed as in (TTTT) . with the ele- 
ments of Z in a uniformly randomly chosen order, then 

l\dm(M lj )= (Hlzi-ZjfUdmizk) ] ( Y[ dm t T u) ) dv ( U ) (12) 

hi \i<j k J \i<j J 

where v is the Haar measure on the on the unitary group U (n). 
Conditional on G\, the matrix M := G^ l G2 has the density 

e -TT(M*G\G\M) | (Jet(Gi) I 2 " 

with respect to the Lebesgue measure on GL(n,C) C C" . From the measure 
decomposition (fT2l) we get the density of Z, T, U, G\ to be 



U^-^\ 2 U dm ^)) e- Tr ^ 1+MM *» | det(G! 



i 2n 



with respect to the measure dv(U)Y[i<jdm(Tij)Y[ijdm(Gi(i, j)) (we have omit- 
ted constants entirely-they can be recovered at the end). Thus the density of Z is 
obtained by integrating over T, U, G\ . Now write Zk = r^e 1 ® 1 so that Z = &R where 
= Diagonal(e' ei , . . . , e' e ") and R = Diagonal(n, . . . , r n ). Then 

MM* = U®(R + ®*T)(R + ®*TY®*U*. 

As v is the Haar measure, dv(U®) = d\(U). The elements of ®*T are the same 
as elements of T, but multiplied by complex numbers of absolute value 1. Hence, 
0*r has the same "distribution" as T. Thus replacing U by ®*U and T by ®*T we 

see that the density of Z is of the form V(n, ■■■,r n )Y[ \zi — z,\ 2 . This is the form 

i<j 

of the density required to apply Lemma [8] Thus we conclude that the eigenvalue 
density is 

Constnizi-zjffl ^ (13) 



i<j k=l 
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To compute the constant, note that 



is an orthononnal set. Projection on the Hilbert space generated by these functions 
gives a determinantal process whose kernel is as given in the statement of the 
theorem. □ 



4 Proof of Theorem El 

We first find the coefficients in the power series expansion of fy prior to taking 
limits using the following lemma. Randomness plays no role here. 

Lemma 9. Let V be an N x N matrix and define f(z) = det(i+zv*) ' Then 

/W(0) =det(V) £ sgn(%)U [7V(V-I C 'I) -7V(V*I c I) 
nes k ce% 

where we write c &nto mean that c is a cycle of%. 

Proof. Letx(z) =det(/+zV- 1 ) andlety(z) =det(/+zV*). Then/(z) = det ^ (z) . 
Hence, 

/<*>(*) =det(V) tQx {k - p) (z) (±) P (z). (14) 

First let us find the derivatives of \|/ and %. Let Vk y be the k x k matrix got 
from V by deleting all rows and columns except the 71, ... , jf 1 ones. 



i]/«(0) = *! £ det^ 



A] 

]\<-<jk 

£ det ^Vj*^ ( summand vanishes if j\ = 7*2) 

Uu—Jk) 

E E sgn(«)n^ 

e^) e n*w 
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The inner sum factors over cycles of 71. Let us write c e 71 to mean that c is a cycle 
of 71 and let \c\ denote the size of the cycle c. Then we may write 



i|f<*>(0)= £ sgn(7T)nTr(V*l c l). 



nes k 



Analogously, we have 



X (A) (0)= £ sgn(7c)nTr(y-l c l). 



(15) 



(16) 



To compute the derivatives of / using (fl4l) . we need the derivatives of cp := at 
0. These will be given by the sequence that we shall provisionally call {bk}- Set 
bo= 1 and for k > 1 define 



fc*= £ sgn(7r)n[-Tr(V 



*|c| ~> 



cG7t 



Then for any A; > 1 we calculate using (1151) 

£(*V,^(o) 

;=o V/ 

= E ( E ^n(^r[Tr(V* |c| )) ( 

rc[Jk] \ire5(r c ) C63I / \ 



E sgn(jc)n[-Tr(V*l c l)]). 



Fix a subset T C [k\. A permutation of T and a permutation of r c together give a 
permutation of [k] . Let 7C = c\ . . . c\ be a permutation of [£] . Then it can arise from 
summands in which T is a union (possibly empty) of some of the cycles c,s. Thus 
for k > 1 



E 

7=0 



%-;¥ (;) (0) 



E sgn(TT) £ n Tr ( y *' Cil ) El -Tr(y* |ci| 



nes k 
n=c\ ...c/ 



Bc[l]ieB ieB c 
*|c| 1 



£ sgn(7T)n Tr(rl c l)-Tr(V 
0. 



However, the equation cp • \\i = 1 implies that 



E 

,/=o 



1 if = 0. 
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It is also clear that from these equations one may inductively recover (p( fc ) (0) in 
terms of the derivatives of \|/. This shows that (pW (0) = b^. That is 



9 W (0)= £sgn(rc)n 



Tr ( V*l c l 



(17) 



Now we return to the derivatives of /. From (fl4l) . (fT6l) and (TT71) we deduce that 

/*>(o) =det(y) £ ( k )% {k - p) (0)^(0) 

= det(V) £ X |rC| (0)cp |r| (0) 

rc[t] 

= det(y) £ ( £ sgn(7i)nTr(V-l c l) ) ( £ sgn(;c) Ui-^^ 

Ta[k] \nes(T c ) cen J \nes{T) cen 

Just as before, a permutation of T and a permutation of r c together give a permu- 
tation of [k] and a permutation n E Sk can arise from summands in which T is a 
union (possibly empty) of some of the cycles of 71. Therefore 

f®{0) = det(V) £ sgn(rc) £ n Tr ( v " N )Il f-Tr(V*l c ''l) 



sc[/]*'es 



det(V) £ sgn(7c)f[ Tr(V~l c l) -Tr(V* 
nes k 



CG7I 



□ 

The probabilistic part of the theorem comes from the following lemma on 
Haar-distributed unitary matrices. 

Lemma 10. Let U be an N x N random unitary matrix sampled from the Haar 
measure. Fix n > 1. After multiplication by \/N, the first principal n x n sub- 
matrices ofU p ,p>\, converge in distribution to independent matrices with i.i.d. 
standard complex Gaussian entries. In symbols, 



N([U] lJ < n ,[U\ J < n ,...)^(G l ,G 2 ,...) 

where Gi are independent n x n matrices with i.i.d. standard complex Gaussian 
entries. More precisely, any finite number of random variables \/N[U p ]i_j, p>l, 
h j < n > converge in distribution to independent standard complex Gaussians. 
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In the literature, there are many results which are similar in spirit to LemmaflOl 
For instance, Diaconis and Shahshahani [3] computed (a slight mistake in that 
paper was corrected in Diaconis and Evans B2l) showed that if U is sampled from 

Haar measure on <ll(N), then (Tr(£/),Tr(£/ 2 ), . . .) (g h V2g 2 , . . .). Jiang [H, 
answering a question of Diaconis, proved that if p^,qN are negligible compared 
to v^V, then the entries of the principal x submatrix of a unitary random 
matrix U sampled from Haar measure on 11 (N), are approximately independent 
complex Gaussians. Our requirement is somewhere between the two. We need 
only submatrices of fixed size, but of all powers of U. We give a complete proof 
of Lemma [TOl in section [6] 

Proof of Theorem |5j Define f as in the statement of the theorem. Lemma as- 
serts that 



f W (0) =det(V) £ sgn(7T)n [Tr(V-l c 'l) -Tr(V*l c l) 
nes„ ce% 



(18) 



We want to find the limit distribution of {f^ (0) : < k < °°}. For this first let us 
consider Ti(V~ p ) -Tr(V*P) for p > 1. Setting P = [Pi : P 2 ] and Q* = [Q\ : Q* 2 ] 
where P\,Q\ are N x n matrices, from (HI) we get 



V 1 = P\A l Qi +P2Q2 and V* = PiA*Qi +P2Q2. (19) 



Then write 



Tr(V- p )-Tr(V* p ) 

= I (v-\,i 2 ...(v-%, h -(v*) ihi2 ...(v*) ip , h 

h,...,i p 

= E fl(P^-'Qi +P2Qi) lj , lj+l - n(PiA*<2i +P2Qi)i h i j+v 

Here it is implied that i p+ i = i\. Expand each of the products to get a sum of 2 P 
terms. Each of these terms is identified uniquely by an integer < r < p and a 
vector q = (qi, . . . ,q r ) of integers q\ < q2 < ■ ■ ■ < q r which are the values of j 
for which we choose (P\A~ Qi)i j} i i+l (or (PiA*<2i)i 7 -,( J+1 ), while for other j we 
choose (P2Q2)ij,i j+ i in both products. 
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A most important observation is that all summands with r = cancel. What 
remains is 



ri,...,i p r>l q jgq 



v/=l 



(20) 

We are using q to denote the vector (#1, . . . ,q r ) as well as the set {q\, . . . ,q r } but 
this should not lead to any confusion. Now write for each I = 1 , . . . , r 

n n 

{P\A~ X Q\) i i +x = E L ( P 0i 9 ,,a / (A~ 1 ) a/) p,(Qi)p /i i r 



a,=lp ; =i 
n n 



{PxA*Q{) iqviqi+l = £ E( P l)^ ; «/( A *)az,P/(Gl)p^ +1 - 

Fix a choice of r > 1, q and 0C/, |3/, 1 < / < r. Sum over i p in (|20l) . When 

we sum over ij for qi < j < q2, in both the summands corresponding to A -1 and 
A*, we get a factor of (we have displayed only those factors that depend on ij for 

qi<j< qi) 



E (e 



91+1 



( 9l+l— ( «2 



( J Pl) ;92 ,a 2 = (ei( J P2e2) Ml - 1 J Pl) Pl ,a 2 - 



Similarly we sum over for j between q\ + 1 to for every / (where r + 1 = 1). 
Write X/ = qi + i — q\ — \ for Z < r — 1 and A, r = r — q r + qi — 1. Then for a fixed 
value of r > 1, q and a/, (3/, 1 <l <r, as we sum over all ijs in (|20l) we get 



1=1 



1=1 



a-i+v 



l=l 



Any choice of (k\,...,X r ) comes from p different choices of q (by cyclically 
rotating (qi,.. .,??•))• Therefore Tr(V~ p ) — Tr(V* p ) is equal to 



"LIE 

r>\ (% u ...,%r) a t fii 



i=\ i=i 



Y[{Ql{P2Ql) h - l Pl\,a l+v 



(21) 

As before, here r+l = 1. Since PQ has Haar distribution, from Lemma [101 and 
the assumption on A, we know that 

VN(A,Q l (PQ)°P u Q l (PQ) 1 P u Q l (PQ) 2 P u ...)^(X ,G h G 2 ,...). 
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where G,, i > 1 are independent n x n matrices with i.i.d. standard complex 
Gaussian entries and independent of Xq. (Pre-multiplication by Q\ and post- 
multiplication by Pi serve to pick out the first nxn principal sub-matrix of (QP) m ). 
Now consider Q\ {PQ) m P\ . For m = 0, 

Qi(PQ)°Pi = Qi(P2Q2)°Pi 

Hence v^CM^fe) ^ ^ Gi- Next take m = 1. Since PQ = P x Qi +P2Q2, 

Qi(PQYPi=Qi(P2Q2) l Pi + (QiPi) 2 . 

From the m = case, we know that the second summand is O p (N~ l ), whence, 

VN(Qi (P 2 Q2)°Pi , Qi (P2Q2) 1 A ) (Gi , G 2 ) . Continuing inductively, for any m, 
we get Qi{QP) m Pi = Qx{PiQi) m Px + O p {N- 1 ). Thus 

>/N(A, [Qi(P 2 Q2)°Pi], [Qi(P2Q 2 ) l Pi}, ■ ■ •) ± {X , G h G 2 , ■ ■ •) 

in the sense that any finite subset of random variables on the left converge in 
distribution to the corresponding random variables on the right. 

In equation (|2T|) divide each of the r factors in the products inside the brackets 
by and multiply each factor in the product outside the brackets by v^V- A* 
itself converges to in probability and thus after dividing by y/N, in the first 
product only A -1 survives in the limit. Thus we get 



Tr(V~ p ) - Tr(V s 



r>\ (V->V) a,,(3/,Z<r/=l 1=1 



r 

-1, 



= PL I E n(V^)(a/,a /+1 ). 

r>l a/,l<rl=l 

Use this in £[8]) and observe that det(V) = det(A) det(P*<2*) = det(A)e' e where 
is uniform on [0,2rc] and independent of A. Absorb e' e into Xq and denote 
H p = X ~ 1 G P . Then we see that iVifw(O) converges in distribution to (jointly for 
k > 0, of course) 



det(X ) £ sgn(rc)n 



C H E E fl(X l G h )(a h a l+l ) (22) 

r>l (A,] X r ) Ui,l<rl=\ 

V ^>i,E^=kl / 
19 



where \c\ is the number of elements in the cycle c. We must reduce this further. 
When we completely expand the products in (1221) we see that the right hand side 
is equal to (as usual r,- + 1 = 1) 

m m ri 

£ sgnwnw ill nn^(«i' a i+i)' 

7 

The point is that many of the terms fin%.( a j! can ar ^ se f rom more than 
one permutation % and thus there is a lot of cancellation. This we investigate now. 

L 

Consider any term fT H Ml (siJi) where L > 1, hi > 1 for each / < L and 1 < 
Si, t[ < n. We compute the coefficient of such a term in (T23T) . 

L 

To organize the combinatorics that will emerge, for the term fT H^^siJi) let 

l=i 

us associate a directed multi-graph with edge-weights as follows. We assume that 

L 

£ /ui = k as only such terms can appear in (T23l) . 

The graph will have vertices {1,2, ... For each / < L, put a directed edge 
from si to and give it weight /j/. Let us also put self loops with edge- weight 
at each vertex v G {1, . . . ,n}\{siji : / < L}. Let us call this graph Q (depends on 
/jj, si etc, of course, but it would be horrifying to include that dependence in the 
notation!). We group terms together by the graph they generate and find the total 
contribution for each graph. 

The graph Q can arise from a term in (|23T) only if the edges of Q can be 
partitioned into edge-disjoint directed cycles. Note that Q is a multi-graph and 
hence if i — > j occurs twice in Q , the two instances will occur in two distinct 
cycles, but the cycles will be deemed disjoint. Also, a cycle may visit the same 
vertex more than once. 

Furthermore, each such decomposition of Q into disjoint cycles corresponds 
to some (usually more than one) choice of the permutation n in (1231) . Once 71 is 
fixed, the numbers r,- are just the sizes of cycles in this cycle decomposition of Q 
and X'j, a'j are also determined. An example is given below to elucidate the matter. 

Example 11. Suppose n = 6 and let k = 11. Suppose we look at the term 

Hi(l,3)fl2(3,2)fl2(2,l)i? 1 (2,l)H3(l,4)ff2(4,2). (24) 



20 



This term can actually arise in (1231 because the sum of the /uis is equal to k and 
the associated graph may be decomposed into disjoint cycles in two distinct ways: 
Firstly, as {(1,3,2), (1,4,2)} and secondly, as (1,3,2, 1,4,2). 

The first case, { ( 1 , 3, 2) , ( 1 , 4, 2) }, can arise from any permutation n E Si i that 
has two cycles of lengths 5 and 6 (these numbers come from adding the edge 
weights in each cycle). There are ( 5 )4!5! such permutations and they all have 
sign —1. Taking into account the weight nl c i| i n (1231 , the contribution to the 
term (l24l) from all such permutations is —11!. 

The second case, (1,3,2, 1,4,2), can arise from any 7t 6 Sn that is itself a 
cycle of length 11. There are 10! such permutations and they all have sign +1. 
Their total contribution is +11!. When put together, we see that the coefficient of 
(1241) is zero. This is no coincidence and prepares the reader for what is stated next 
in general. 

Claim: If Q can be decomposed into disjoint cycles in more than one way, then 
the coefficient of the corresponding term in (1231) is zero. On the other hand, if g 
can be decomposed in a unique way into i disjoint cycles, then the coefficient of 
the corresponding term is (— I) k\. 

Proof of the Claim: Suppose Q can be decomposed into disjoint cycles in more 
than one way. Then some vertex, say 1 without losing generality, belongs to more 
than one cycle of Q . Then in g , there are in-edges ii, . . . , ?'m leading to 1 and 
out-edges y'i , . . . , 7m leading away from 1 , for some M > 2. In any decomposition 
of g into cycles, we have the obvious matching of in-edges with out-edges, by 
associating to each in-edge the out-edge that follows it in the cycle. Consider any 
cycle decomposition of g , and suppose that in-edges i\ , ?2 are matched with j\ , 72 
respectively. We pair this cycle decomposition with a new cycle decomposition 
got by switching the matches i\ — > 71,12 — > 72 to i\ — > 72,1*2 — > 71 and leaving 
everything else intact. This leads to a pairing of all cycle decompositions of g . 
We show that the total contribution from each pair is zero. 

One can go from one cycle decomposition to its pair by splitting a cycle into 
two cycles or merging two cycles into one. Let us take the first one among 
them to have cycle sizes 0i, . . . , Qi and the second one to have cycle sizes 0i + 
02, 03, . . . , Bp. Let the sums of edge- weights along cycles in the first decompo- 
sition be w\,W2,---,W£ so that in the second cycle decomposition the sums of 
edge- weights of cycles are w\ + h>2, W3, . . . , W£. 

The permutations in Sk that respect the first cycle decomposition of g are 
precisely those with £ cycles of sizes w\ , . . . , W£. The number of such permutations 
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is 

^-n(m-l)!. (25) 
nw ; !' =1 

!=1 

t 

Each of these comes with the weight n w i m d23l) . whence the total contribution 

(=l 

of these terms is ( — \) k ~ ( k\. 

The number of permutations that respect the second cycle decomposition of 

g is 

— (wi+w 2 -l)\Yl(wi-l)l. 

(wi+w 2 )i n w { \ i=3 

i=3 

t 

Each of these comes with the weight (w\ +W2) Y[ w i m (|23l) . whence the total 

(=3 

contribution of these terms is ( — l) k ~ e+l k\. 

Thus the two cycle decompositions exactly cancel each other out and it is seen 
that the total contribution is zero. This proves the first part of the claim. 

For the second part, there is only one cycle decomposition by assumption, and 
the same calculations that led to (1251) show that the coefficient is ( — l) k ~ e kl. This 
completes the proof of the claim. 

Now consider a Q that has a unique cycle decomposition. Then that cycle 
decomposition may be regarded as a permutation x e S n , where all those vertices 
that do not occur among {siJi} are fixed points of x (this is why we added self- 
loops to all these vertices when defining g). Recall that the edge- weights of 
these self-loops is 0. It will be convenient to set Hq = I n . Observe that sgn(x) = 
(— l) n ~ e . Then using the claim above to simplify (|23l) we finally have 

tfff<*)(0) 4 (-l)*-»*!det(Xb) £ sgn(x) £ f[{H w ^ r (26) 

T£5 n (»>[,. . .,»'«) 1=1 

vv;>0,£w,=fc 

i 

Here H p = X Q l G p for p > 1 and H = /„. If we forget the factor, the 

right hand side of (|26l ) is precisely k\ times the coefficient of z k in the power 
series expansion of det(Xo + zG\ +z 2 G2 + ■■■) as may be seen by expanding the 
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determinant as 

det(Xo) £ sgn(x)f[(I + zH l +z 2 H 2 + ...) im . 

VES„ i=l 

The factor (— \) k ~ n is rendered irrelevant by changing z to — z and multiplying the 
whole function by ( — 1)". 

Thus we have proved that any the power series coefficients of iVzfjv converge 
jointly in distribution to the coefficients in the power series of det(Go + zGi + 
z 2 G2 + ■ ■ .), in the sense that any finite number of coefficients in the former con- 
verge jointly in distribution to the corresponding coefficients in the latter. This 
completes the proof of the theorem. □ 



5 Hyperbolic ensembles 

In this section we prove Theorem HI We shall make use of the following result of 
Zyczkowski and Sommers ifTTl . 

Result 12 (Zyczkowski and Sommers(2000)). Let U be an (N + n) x (N + n) 
unitary matrix sampled from Haar measure on u(N + n). Let V be the N x N 
principal sub-matrix got by deleting the first n rows and columns of U. Then the 
eigenvalues of V form a determinantal process in the unit disk D with kernel 

<fe-)=i(T 1 ) ( " 1)W 

with respect to the reference measure djj n (z) = |(1 — \z\ 2 )"~ 1 dm(z). 

Remark 13. If V is a matrix, then ^jL z y)^ is just the Blaschke product of the 
eigenvalues {Xk} of V. That is 



det(z/-V) * z-\ 



FT- - 

det(/-zV) fJi\-zh 



(27) 



That being the case, the function fjy in Theorem|4]depends only on the eigenvalues 
and not the matrix V that we choose. Why then, do we use the truncated unitary 
matrix of Result d2] instead of directly using the diagonal matrix whose entries 
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are a determinantal process with the truncated kernels? It may indeed be possi- 
ble to prove Theorem IH directly from the properties of determinantal processes 
without having to use Result [121 However, that would involve proving a bevy of 
central limit theorems (of non-linear statistics) for determinantal processes that 
can substitute Lemma [101 We do not know if that is easy. 

The advantages of the truncated unitary matrix over the diagonal matrix of its 
eigenvalues are: (1) The former is invariant under left and right multiplication by 
unitary matrices (which allows us to apply the rather easy Lemma [TOl). (2) The 
truncated unitary matrix has only n nondeterministic singular values, even as the 
matrix size goes to infinity. The cost is that we use the (far from trivial) result of 
Zyczkowski and Sommers but this has the positive value of forging a direct link 
between random matrices and random analytic functions. 

Applying Theorem \5\ to truncated unitary matrices we almost get Theorem 
but there is one snag. Theorem [5] gives convergence of coefficients in the power 
series whereas to deduce convergence of zeros, we need uniform convergence on 
compact sets. For instance, in the sequence f n (z) = n"z n , all the power series coef- 
ficients converge to zero but f n (z) does not converge for any z ^ 0. The following 
lemma, deduced directly from properties of determinantal point processes, will 
establish the required tightness, a priori. 

Lemma 14. Fix n > 1. Let {X^ : 1 < k < N}, be determinantal on the unit disk 
with kernel Kjy with respect to the measure /u n as in Result\L2\ Set 

k= l 1 — ZAk 

Then for any compact subset K 6 D, the set {f^v(z) : z<E K} is tight, uniformly in 
N. 

We postpone the proof of Lemma [141 to section [7] and proceed to prove Theo- 
rem [4] assuming the lemma. 

Proof of Theorem 31 Let U be an (N + n) x (N + n) unitary matrix and write 

U 



A C* 
B V 



where A has size n x n. By the unitarity of U, we have the following equations. 
A*A+B*B = l n and BB* + VV* = I N . 
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As BB* and B*B have the same nonzero eigenvalues, it follows that VV* has the 
same eigenvalues as A* A, except that it has N — n more eigenvalues all equal to 1. 
Thus there must exist unitary matrices P,Q E 11 (N) such that 



V 



A 






In-h 



(28) 



Now suppose U was sampled according to Haar measure on ll(N + n). Then for 
any unitary matrices Pq, Qq E 11 (N), we have 



" /„ 


" 




' A 


C* ' 




'In 





d 


' A 


C* ' 









B 


V 







Qo _ 




B 


V 



because Haar measure is invariant under left and right multiplication by group 
elements. This shows that PqVQq = V, which, together with (|28l) implies that 

Q 



V = P 



A 






In-h 



where P, Q,A are independent, P, Q are distributed according to Haar measure on 
U (N) and A is the principal nxn submatrix of an (N + n)x (N+n) unitary matrix. 

LemmaflOl show s that y/NA — > Go where Go is an n x n matrix of i.i.d. standard 
complex Gaussians. Thus Theorem [5] applies and we get 



N 1 



n det(zJ-V) 



det(I-zV* 



det £ G k z k 



where all G k , k > are i.i.d. matrices of i.i.d. standard complex Gaussians. 

This convergence is only in the sense of pointwise convergence of coefficients 
in the power series. But in case of truncated unitary matrices, Result [12] and 
Lemma [14] together strengthen it to uniform convergence on compact subsets of 
D. Therefore the zeros of % converge in distribution to the zeros of the limiting 
analytic function. 

The upshot is that the point process of eigenvalues of V (which are exactly the 
zeros of f/v) converge in distribution to the zeros of det(Go + zGi +z 2 G2 + ...). 
Use the result of Zyczkowski and Sommers and let iV — > °°. The kernels Kjy 
increase (in the sense of operators, i.e., the associated Hilbert spaces increase) to 
the kernel K(z, w) = (1 — zw)~ n ~ l . From the facts stated after definition [FT] in the 
appendix,it follows that the determinantal process with kernel Kyy converges to 
the determinantal process with kernel K and the proof is complete. □ 
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6 Proof of Lemma 



10 



In proving Lemma [TOl we shall make use of the following result on the joint 
moments of entries of a unitary matrix from the book of Nica and Speicher lfT2l . 
page 381 (we state a weaker form suited to our purpose). 

Result 15. Let U = ((uij))ij<N be chosen from Haar measure on u(N). Let 
k<N and fix i(£) , ;(£) , i'(£) , )'(£) for 1 < i < k. Then 



E 



k k 



where Wg (called "Weingarten function") has the property that as N — > °°, 

Wz(N x) - I N ~ k + °( N ~ k ~ 1 ) ^ = e ("identity"). 
8lV,XJ "\ 0(N- k - 1 ) ifx^e. 

Proof of Lemma [M We want to show that VN(U k ) a ^, k>l, l<a,P<n con- 
verge (jointly) in distribution to independent standard complex Gaussians. To use 
the method of moments consider two finite products of these random variables 

m m' 

s=m uk %,^ pi and t =yvsu%^. 

i=\ (=1 

where m,m',pi,p' i ,ki,kf i > 1 and 1 < a,, (3 ; , a^,P' < n are fixed. We want to find 
E[ST] asymptotically as Af — > °°. 

The idea is simple-minded. We expand each (U ) a p as a sum of products of 
entries of U. Then we get a huge sum of products and we evaluate the expectation 
of each product using Result [15] Among the summands that do not vanish, most 
have the same contribution and the rest are negligible. We now delve into the 
details. 

Let (Pk(dL, |3) denote all "paths" y of length k connecting a to p\ This just means 
that y e [N] k+l , y( 1 ) = a and y(k + 1 ) = |3. Then we write 

k 

(tf*)a,p= E ri M Y0 , ),r0 , +i)- 
Expanding each factor in the definition of S like this, we get 

m Pi kj 

s = e nnri''r / v / 1 

i<m;t<pi 
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E 



In words, we are summing over a packet of p\ paths of length k\ from oci to Pi, a 
packet of p2 paths of length ki from 0C2 to P2, etc. T may similarly be expanded 
as 

m! p'i *{ 

t = e nnn M rfa),rfo-+D- 

To evaluate E[S7], for each pair of collections y = {yf} and T = {rf}, we must 
find 

l Pi h \ frri P'i K y 

uuu u ium+D nnn"rfo),rf( 7+ i) • 

K i =u= i j= \ J \ i= u=\j=\ J _ 

Fix a collection of packets yv 6 ^(otj, P ; ). For which collections r| G ^(ocj, P;) 
does (|29l) give a nonzero answer? For that to happen, the number of uus and the 

number of wys inside the expectation must be the same (because e' Q U = U for 
any 9g1). Assume that this is the case. 

It will be convenient to write y(i, £, j) in pace of yf (j) ■ From Result[T5l to get 
a nonzero answer in (l29l) we must have bijections 

{(i,£,j) :i<m,£< p u 1 < j < ki] $ {(i,£,j) :i<m',£< p' h l<j< kft, 
{(i,£j):i<m,£<p l ,2<j<k l + l}^{(i,£J):i<m',e<p' h 2<j<k' i + l}, 

such that 

(Y(Mj j))i<m,l<Pi,\<i<ki = {r{n{h£J)))i<mJ<pi,l<j<ki • 
(y(h£J))i<m,£<pi,2<j<kj+l = (^( (''^J)))i<m,£<p i ,2<j<k i +\ 1 

And for each such pair of bijections n, a, we get a contribution of Wg(A r , 

Let us call the collection of packets y typical, if all the paths yv are pair- 
wise disjoint (except possibly at the initial and final points) and also non self- 
intersecting (again, if a,- = P ; , the paths in packet i intersect themselves, but only 
at the end points). 

If y is typical, then it is clear that for T to yield a nonzero contribution, T 
must consist of exactly the same paths as y. This forces ki = k\ and pi = p\ and 
0C; = OC;,P, = Py for every i. If this is so, then the only pairs of bijections (tc,g) 
that yield a non zero contribution are those for which 

• 71 = o (From the disjointness of the paths). 
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k 

• % permutes each packet of paths among itself. In particular there are n Pi ] - 

i=\ 

such permutations. 
This shows that for a typical y, the expectation in (l29l) is equal to 

lT=y^f[pi^jWg(N,e). (30) 

Here y = T means that the two sets of paths are the same. Now suppose y is 
atypical. For any fixed y, typical or atypical, the number of T for which (|29b is 
nonzero is clearly bounded uniformly by m and pi,k{, i < m. In particular it is 
independent of N. Therefore the expected value in < f29b is bounded in absolute 
value by 

CsupWg(A^,x). (31) 

Now for an atypical y, at least two of yf(j), I < i < m, 1 < £ < pi, 2 < j < ku 
must be equal (our definition of "typical" did not impose any condition on the 
initial and final points of the paths, which are anyway fixed throughout). Thus, 
if we set r = pi(k\ — i) + ... + p m {k m — 1), then it follows that the total number 
of atypical y is less than r 2 N r ~ l . Since the total number of y is precisely _/V r , 
this also tells us that there are at least N r — r 2 N r ~ l typical y. Put these counts 
together with the contributions of each typical and atypical path, as given in (l30l) 
and (|3TT) . respectively. Note that we get nonzero contribution from typical paths 
only if S = T. Also, the total number of factors in S is r + LP; (this is tne i n 
Result [H]). Hence 

m 

E[ST\ = l s=T N r (\-0(l/N))Wg(N,e)Y[pS. + 0(N r - 1 ) sup Wg(^V,x) 

i=l I^Sr+Zpt 

= i ^- Epi (n^'J( i +°(^)) 

by virtue of the asymptotics of the Weingarten function, as given in Result [T5I 

The factor N^ Pi is precisely compensated for, once we scale (U ) a r by ^/N, as 
in the statement of the lemma. Since the moments of standard complex Gaussian 
are easily seen to be E[g p g q ] = p\l p=q , we have shown that VN(U k ) a ^, k > 1, 
a, P < n, converge to independent standard complex Gaussians. □ 
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7 Proof of Lemma 14 



We prove Lemma [14] in this section. We shall make use of the following fact, 
which is a direct consequence of (61, Theorem 26. 

Result 16. Fix n > 0. Let {k\, . . . ,?Wv} be determinantal on the unit disk with 
kernel 

N-l 

\kk—k 



N ~ l f-n - 1\ 



with respect to the background measure d/u n (z) = ^(1 — \z\ 2 ) n ~ 1 dm(z). Then the 
set {|Xyt| 2 : 1 < k < N} has the same distribution as {Y^ : <k < N — 1}, where 
YkS are independent random variables and Yt has distribution Beta(& + l,n). 

As a consequence of this result, it is very easy to see that A^f^O) is tight. 
For, 

E[M0)| 2 ] = A^nV,] 

k=0 

= N n n 



k=Q n + k+l 

AT 



(N+l)(N+2)...(N+n) 
n\ 



as N — > °°. For z 7^ it is not as simple, because for finite Af, the distribution of 
{| I } is not the same as that of a set of independent random variables. In the 
N — > 00 limit, it is, but is of no use to us. 

Proof of Lemma H Write <p z (X) = f^. Write 

£ log |cpz(^)| 2 = - f (i - Mh) I 2 ) + 1 W 

k=\ k=\ k=\ 

where h z {X) : = log |cp,(X)| 2 + (1 — |(p z (X)| 2 ). The lemma will be proved by show- 
ing that the following are tight (uniformly over z in compact sets, as N varies). 

N 

• I h z (l k ). 

k=l 
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N 



Let us consider them one by one. 



-n\ogN + E (1 



12 > I 
^ 2" 



A 7 

1 . Consider £ h z (Kk) . For each z G D, the function (f> z maps the unit disk onto 

Jfc=l 

itself and the unit circle onto itself. Therefore, given a compact set K, we 
may find T < 1 such that for any z E K and |A,| > T we have |cp z (X)| 
From the power series expansion of log(l — x), we then get 

\h z (l)\ < 2(1 - |(PzW| 2 ) 2 for z G K, \l\ > T. 

(l-\z\ 2 )(l-\X\ 2 ) 



Observing that 



\l-zk\ : 



it follows that 



A 7 



<2 £ \h z (X k )\+c £ (l-fo 
M<r |A*|>r 



|2\2 



for a constant C (does not depend on N or z, as long as z G A"). The first 
summand is tight because the set : |^| < T} converges to the set of 
points in T~B in the limiting determinantal process. The second summand 
may be stochastically bounded by YfelaO- ~Yk) 2 by Result [T6l From the 
explicit distribution of Y^s, we may compute the expected value of this sum 
as 

n(n+l) 



E 



N-l 
k=0 



N-l 



k=0 



{n + k+l)(n + k + 2)' 



The random variables on the left are stochastically increasing in N and 
hence, and hence, a uniform bound on the expectations shows tightness. 



A 7 



2. Consider —n\ogN+ £ (1 — |(p z (A,^) | ). We shall show tightness by proving 

k=l 

that the expected value and variance of these random variables are bounded 
uniformly in N (and z G K). 
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Expected value: As always, it is simpler for us to deal with \kk\ 2 - There- 
fore, write 



1-|<PzMI 



(l-|z| 2 )(l_-W 2 ) 
\l-zk\ 2 

CO 

= (l-|z| 2 )(l-|X| 2 ) £ zWX p . 

p,q=0 

Set X = Xk, sum over k and take expectations. Any term with p^q vanishes, 
because of rotation invariance of For p = q, we get terms with \kk\ 2 
which may be replaced by independent Beta random variables by Result [T6l 
Thus 



E 



N 



£(l-|q>z(A* 



k=\ 



:i-N 2 )Ekl 2p E 

p=0 



A? 



£(i-|a*fW 



/i I |2\ V I |2pf Beta(fc + ;?,«) -Beta(£ + p + l,n) 
~ p^o jfel Beta(M 

n | ,2xf , ,2 P f n*(fc+l)-(fc + n-l) 
p=0 S (* + p).-(* + P + n) 

A 7 



(l-N 2 )LW 2p l(^ + err, 



p=0 fc=l 

where |err^| < C(p)k~ 2 where C(p) is at most a polynomial in p. Thus, it 
follows that 



E 



N 

I 

k=l 



I(1-|CP Z (^)| 2 ) 



(nlogA0(l-|z| 2 ) EN 2p |+0(l) 

\p=0 

/zlogiV + 0(l). 



This is exactly what we wanted to show about the expected value. 



N 



Variance : Now we want the variance of := £ (1 — |(p z (A#)| ). No 
really new ideas are needed, only the calculations are more tedious. Expand 
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X N in power series as before to get 



E[X 2 ]= £ z p+r z q+s E 

P,q,r,s>0 



N 



(i-N 2 )(i-|^| 2 ) £ tyityl 

kl=\ 



(32) 



All terms in which p + r ^ q + s vanish, by rotation invariance. Fix p, q, r, s 
so that p + q = r + s and write the inner expectation as 



E 



N 

I 



£(l-|^| 2 ) 2 |^| 2p+2r 



+ E 



Z(i-fa\ 2 )(i-\u\ 2 )mw-i 



The first one is already a function of the absolute values of X^s and hence 
replacing them by independent Beta random variables, we get (details are 
similar to those in computing the expectation) 



E 



N 



N 



2\2|i |2p+2r 



£(i-\l k \ 2 ) 2 \X k 

k=\ 

Beta(£ + p + r, n) - 2Beta(£ + p + r + 1 , n) + Beta(£ + p + r + 2, n) 



k=\ 



N 



Beta(£,n) 
k(k+l)...(k + n-l) 



= n(n + 1) > - 

k % (k + p + r)...(k + p + r + n + l) 

which is bounded because the summand is of order k~ 2 . Of course, we shall 
have to sum over p, q, r, s, but it is clear that because of the factor of |z| 2/7+2r 
in (1321 ), the total contribution to (1321 ) from this summand (all terms with 
k = £) is bounded as N — > °°. 

It remains to consider the sum over k ^ £. The two point correlation is 

K N (X,X)K N (^) - K N (l,QK N (^,X). We consider 



E 



£(i-|^l 2 )(i-N 2 )^M 

k+i 



= (i 



)(i - |4| 2 )^'k'(k„(X,x)k w (4,4) - IM«)| 2 ) ■ 



Consider the first summand, where we choose the term Kjv(A,,A,)Kjv(^,^) 
inside the brackets. This survives only if p = q and r = s and it is easily seen 
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that this term when summed over p = q and r = s in (1321 ) will give exactly 
E[X] 2 . When we compute the variance of X, we shall subtract E[X] 2 from 
E[X 2 ] and this term gets cancelled. 

Thus to show boundedness of the variance, we only need to show the bound- 
edness of 

(1 - \l\ 2 )(l - |^| 2 )^X p ^f I^(X^)K^(t ^^(A,)^^). (33) 
Recall that 

where Cj = (""r 1 ) (-1)- 7 '. Also, for any j, 

Cj J \X\ 2 Jdp n (X) = l. 

D 

Therefore the integral in d3~3l is equal to 



£ QQS?+*,.p+A+./>+i 1 

i,j=0 \ L P+j L P+j+l 



1 1 



C r +i Cr+i+1 



We fixed p, q, r, 5 such that p — q = s — r. Therefore, there are N — \p — q\ 
choices for (z, j) which do not vanish (if \p — q\ > N, there are no such 
terms). Without losing generality, let p < q, and write the above quantity as 

T'c, c „-, (- l —) (- l - 

N~\p-q\ n 2 

§ 0-+i)0-+i+^- P ) +c>(1) 

by writing out the expressions for C ; s. Thus, this term, when summed over 
p, q, r, s yields a bounded quantity. 
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In summary, we wrote E [X 2 ] as in (|32l) . Terms with k = £ yielded a bounded 
quantity. Terms with k ^ £ were split into two sums. One of them is bounded 
while the other is exactly equal to E[X] 2 . Thus the variance is bounded as 
N — > oo. 

This completes the proof of Lemma [14] □ 

8 Concluding remarks 

We record here two among several natural questions that arise from the consider- 
ations of this paper. 

1 . In TheoremSl the determinantal process exists for any positive real n, whereas 
the matrix analytic function makes sense only for integer values of n (size 
of the matrix!). Is there a random zero set interpretation for hyperbolic 
determinantal processes for non-integer values of nt 

2. Are there random matrices of the form given in Theorem[5]for which we can 
calculate the exact distribution of eigenvalues? Recall that these are random 
matrices for which: (1) The distribution is invariant under multiplication by 
unitary matrices. (2) The number of random singular values stays fixed even 
as the size of the matrix goes to infinity. 

If this can be done, then presumably we shall also get the distribution of 
zeros of det(Xo + zG{ +z 2 G2 + ■■■) (we say "presumably" because we have 
not proved uniform convergence of f# on compact sets except in the the 
special case of truncated unitary matrices). 

Acknowledgements: I thank Balint Virag for asking me the question of finding 
determinantal processes on the sphere, Yuval Peres for innumerable illuminat- 
ing discussions and for great encouragement throughout the project, and Mikhail 
Sodin for suggesting the use of Wick calculus which led to a verification of two- 
point correlations in IfTOl and convinced us that Theorem |4] must be true. I am 
greatly indebted to Brian Rider for pointing out the paper of Zyczkowski and 
Sommers which finally enabled me to prove Theorem 0] 

9 Appendix: Determinantal point processes 

We give a brief introduction to determinantal processes, strictly limited to the con- 
text of this paper. More details, as well as proofs, may be found in the surveys [16] 
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or©. 

Let Q. be a region in the plane and let p be a positive continuous function on 
£1. Define the measure /j by d/j(z) = p{z)dm(z). A simple point process on Q. is 
a random measure on Q. that takes values in counting measures on £1 and gives 
finite measure to compact sets. If the number of points in any compact set has 
exponential tails, then the distribution of the point process is determined by its 
correlation functions (joint intensities) with respect to /j, 

, . V\x has points in each of D(zi, e), 1 < i < k) 
Pk(zi, ...,z k )= hm 

7=1 

for any k > 1 and any zi, ■ ■ ■ ,Z/t E ^- There is also an integral version of this 
definition which is more appropriate in more general situations. Joint intensities 
need not exist in general. 

Consider the space L 2 (£1,/j) and its subspace H consisting of holomorphic 
functions. It is a fact that H is a closed subspace. For, suppose f n E H and 
fn — * / m L (n). Then for any z E £1, consider a disk D(z,r) contained entirely 
in £1. Convergence in L 2 and absolute continuity of /j shows that f„s converge to 
/ in L 2 (m) on the annulus D(z, r)\D(z, §). Apply Cauchy's integral formula to 
infer that /„ — > f uniformly on D(z, ^). Therefore / is itself analytic on Q. This 
shows that H is a Hilbert space. This reasoning also shows that for any zE Q., the 
evaluation / — > f(z) is a bounded linear functional on H. 

Switch notations and let H denote any closed subspace of L 2 (/u) consisting 
of holomorphic functions (not necessarily all holomorphic functions). Then the 
evaluation / — > f(z) is a bounded linear functional on H. As a consequence, if 
{V'i}«>i is any orthonormal basis of H, then the series 

^(z 5 w)=£¥n(^)¥„(w), 

does converge and K (called the reproducing kernel of H) is independent of the 
choice of the basis. The integral operator on L 2 (/j) defined by 

ac/(z) = / f(w)K(z,w)dfi(w) 
h 

is precisely the projection operator on L 2 (/j) onto the subspace H. 
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Definition 17. Let H be a closed subspace of L 2 (/j) consisting of analytic func- 
tions, and let IK be the reproducing kernel of H. Then a point process X onQ. with 
joint intensities given by 

P*(zi>- ■■,Zk)= det (K(zi,Zj)) iJ£k 

does exist and is called the determinantal point process on Q. with kernel K with 
respect to the measure /j. We also say that H is the associated Hilbert space. 

We just state a few facts regarding these processes. 

1. The number of points in X is almost surely equal to the dimension of H. 

2. If subspaces H„ increase to H, then the corresponding determinantal pro- 
cesses converge in distribution, i.e., X n — > X. This was used tacitly in de- 
ducing Theorem |4] from Result [T2l 

3. If H = span{l,z, . . . ,z n ~ 1 }, then by writing out the density (which is just 
^p„(-)), one sees that the vector of n points of the the process has density 

n 

proportional to TT \zi — Zj\ Yl p(zi) with respect to the Lebesgue measure 

i<j (=1 

on Q! 1 . 
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